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We study network configurations tliat provide optimal robustness to random breakdowns for 
networks with a given number of nodes A'' and a given cost — which we take as the average number 
of connections per node (k). We find that the network design that maximizes fc, the fraction of nodes 
that are randomly removed before global connectivity is lost, consists of g = [{{k) — \)/ ^/{k)]^/N 
high degree nodes ( "hubs" ) of degree -y/ {k)N and N — q nodes of degree 1. Also, we show that 1 — /c 
approaches as 1/\/N — faster than any other network configuration including scale- free networks. 
We offer a simple heuristic argument to explain our results. 
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I. INTRODUCTION 

Recently there has been much interest in determining 
network configurations which are robust against various 
types of attacks While there 

have been studies of complex combinations of different 
types of attacks, surprisingly, there has been no focused 
analysis of the the elementary case of robustness against 
simple random breakdowns or attacks. 

We first study simple random networks. Simple net- 
works contain no self loops or multiple edges neither of 
which add to the robustness of a network to random re- 
moval of nodes. For simple random networks we can de- 
termine the optimal network configuration analytically. 
Randomly constructed networks, however, may have dis- 
connected components, so we also consider networks con- 
structed in a way that ensures they consist initially of a 
single cluster of connected nodes. These networks are 
degree correlated. For degree correlated networks, there 
currently exist no closed-form expressions with which we 
can determine fc analytically, so we study them using 
Monte-Carlo simulations. We find that the optimal con- 
figuration for both the randomly constructed and the de- 
gree correlated networks consists of g ~ y/N high degree 
nodes (huh nodes) of degree ^2 ~ and N ~ q nodes 
of degree 1 [leaf nodes). 



II. UNCORRELATED NETWORKS 
A. Theory 

We first treat simple random networks. It is known 
that for any desired random degree distri- 
bution, simple networks can be created only if P{k) = 
for k greater than the structural cutoff 



(1) 



So we must limit our networks to those with maximum 
degree less than y/ {k)N. For networks with this con- 
straint we can use the equation 3, 14] 



/c = l- 



K- 1 



where 



- {k} 



(2) 



(3) 



to determine fc- 

Since we fix (fc), maximizing fc is equivalent to maxi- 
mizing (fc^). We must maximize 



Ks 



h{p)^Y.^^p{k) 



k=l 



under the following constraints 
P{k) > 



J2kP{k) = {k). 



k=l 



(4) 
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We first show that there can be no more than two unique 
values of fc at which P{k) is non-zero if h{P) is to be max- 
imized. Assume that there are m > 2 non-zero values 
P{ki), P(k2), P{ks)...P{km) needed to maximize h{P). 
Using the method of Lagrange multipliers |l5j we can 
write 



dPikj] 



dP{k,) 
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Xikj + X2 







[1 < j < m] 



(9) 



where Ai and A2 are constants. Solving ^ we find at 
most only two unique solutions for the values of kj . 

Analyzing the problem now with only two values ki 
and k2 for which P{k) are non-zero, we find that h{P) is 
maximized when ki and k2 take on the boundary values 



fci = 1 
k2 = Ks. 



and 



{k)-l 1 



P(fci) = 1- 
P(k2) = 

For these values I — fc assumes its minimal value 



(fc)-l 1 



(1 - /c)m 

For large N, 



{k)^W}N 



(l-/c) 



((fc)-l)(l-(fc)7V+/(fc)iV) 

(fc) 1 



((fc)-i) Viv' 
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B. Simulations 

We next perform Monte Carlo simulations to test the 
results found above. We consider the degree distribution 
that represents a network of q hub nodes and N — q leaf 
nodes, 



P(k) 




k = 1 



^ k = k2 



otherwise, 



(14) 



where 



((fc)-l)7V. 



(15) 



Our aim is to find the value of q which maximizes the 
robustness of the network. 

We create networks using the method described in 
Ref. We then randomly delete nodes in the net- 

work and after each node is removed, we calculate k. We 
use the criterion 



K < 2 



(16) 



for loss of global connectivity ^ |^ 0| . When k be- 
comes less than 2 we record the number of nodes re- 
moved up to that point. This process is performed for 



many realizations of random graphs with the degree dis- 
tribution of Eq. H14|l and, for each graph, for many differ- 
ent realizations of the sequence of random node removals. 
The threshold fc is defined as 



(17) 



where (rij.) is the average value of rir. 

In Fig.lIIa), we plot l-/c versus q for TV = 10^, 10^, 10"* 
and 10^ and (fc) — 2. In Fig. 2(b) we plot the location of 
the minima qrain versus N . As expected Qmin scales with 
N as N^-^ and as shown in Fig. 2(c) the minimum values 
oil — fc scale as iV~°-^. 

Also shown in Fig. ^a) are plots for approximations 
to /c, fc^^^ and which we expect to be valid re- 

spectively for high and low values of q. We will use these 
approximations as another way to show that gmin and 
(1 — /c)min scale as found above. The approximations 
are determined as follows: 

(i) When q ^ N (i.e., the network is homogeneous), 
we expect Eq. © to hold, so /^^'sh = 1 _ - 1). 
For general (fc), using the distribution in Eqs. H14|l. 
we find for > q > 1 



jhigh _ ^ _ 



m-m' 



(18) 



(ii) 



As found in [lj|, Eq. (|2J) is not valid for small q. We 
must use an approximation based on the fact that 
for small q the network loses global connectivity 
when all q hi gh degree nodes are removed. To first 
order in 1/g 



1 



J'iOW 1 _ 



(19) 



Equating Eqs. (|18|l and (|19|l we find the value of q at 
which the approximations intersect 



q* = V(fc) - iVN. 



(20) 



From the fact that scales like vN, we conclude that 
all characteristic values including the location of the min- 
imum of (l-/c) scale like \/iV with a prefactor dependent 
on (fc). 

From Eqs. filml and ((SOJ we find for large N, 



1 - /; = 



1 



1 



(21) 



where /* is the value of value of fc where the approxima- 
tions intersect. The scaling of q* and 1 — fc* are shown 
in Figs.njb) and (c). 

We next study the effect of changing (fc). Figs.l^Ja) and 
(b) contain plots of gmin and (1 — /c)min respectively for 
(fc) — 2, 3, and 4. We note that the scaling is independent 
of (fc) with only a change in the prefactor. 
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III. CORRELATED NETWORKS 

In Fig.|3Ia) we show an example of a randomly created 
graph. Note that, because the graph is created randomly, 
there are some disjoint portions of the graph consisting of 
pairs of nodes connected to each other. Thus the network 
does not consist of a single connected component. We 
now study correlated networks which do not have this 
shortcoming by disallowing connections between degree 
one nodes so that the resulting network is a single cluster 
(see Fig. |3Jb) which has the same degree distribution as 
Fig. Eta) ). 

For correlated networks, the criteria for network col- 
lapse is 01 



det{A) = 



(22) 



where A is a matrix containing elements = kej^k + 
qjSi^j with Cj^k the joint probability of the remaining de- 
grees 131 of the two vertices at either end of a randomly 
chosen edges and with the probability of the remain- 
ing degree of a single vertex at the end of a randomly 
chosen edge. 

We create networks having the degree distribution of 
Eq. (|14|l with (fc) = 2 but with the constraint that leaf 
nodes cannot be connected to each other. We proceed 
as for uncorrelated networks except that after removal of 
an edge instead of calculating k we calculate det{A) and 
note the number of nodes removed before det{A) ~ 0. 

In Fig Ufa) we plot 1 - /c versus 9 for TV = 10^, 10^ 
and 10'' [13 ■ We note that the plots are similar to but 
slightly higher than the corresponding plots for the ran- 
dom networks. In Fig. ^h) we plot the values of q at 
which 1 — /c is minimal and see that they scale in a man- 
ner similar to the scaling of the positions of the minima 
for the random networks. 



IV. COMPARISON WITH SCALE-FREE 
NETWORKS 



Scale- free networks with A < 3 are known to be very ro- 
bust against random attack with 1 — /c approaching 
zero as — > 00. Here, we determine the large N behav- 
ior of 1 — /c for scale-free networks for a given value of 
(k) and compare the behavior with that of the optimal 
bimodal network. 

We consider a scale- free degree distribution -P(fc) 
with m< k < K. For large K and 2 < A < 3 



2- A 

3- A 



(23) 



Substituting in Eq. (O and setting K = Kg we find that 
for large K 



Only in the limit of A approaching 2, does 1 — fc ^ N 
similar to Eq. lO. For A < 2, 



2- A 
3^^ 



-K 



N. 



and 



(25) 



(26) 



but for A < 2, (fc) diverges with increasing K. Thus for a 
given value of (k), 1 — fc for the optimal bimodal network 
always approaches faster than the optimal scale-free 
network [20j. 

For completeness, to ensure that large variance is not 
a deficiency of the optimal network, we now study how 
the variance in fc of the optimal network compares with 
the variance of the scale-free networks . Specifically in 
Fig. 0] we plot the standard deviation 



N 



(27) 



3~A 



(24) 



vs N for the optimal bimodal network with (fc) = 2 and 
for a scale-free network with A = 2. For the scale- free 
network with A = 2, 1 — /c ^ N''^-^ although it has a 
large value of (fc) . We see that the standard deviation of 
fc of both networks decreases as N~^-^ with the scale- 
free network having a somewhat smaller prefactor than 
the optimal network. Thus the variance of fc is not a 
deficiency of the optimal network. 



V. HEURISTIC ARGUMENT FOR OPTIMAL 
CONFIGURATION 

We now provide a heuristic argument for the optimal 
configuration which applies to random or correlated net- 
works. As shown above, the configuration consists of 
q ^ ^/N high degree nodes (hubs) of degree a/ (fc)A^ and 
N — q nodes of degree 1. Intuitively, we suspect that 
the optimal configuration is one in which there are many 
leaf nodes with degree 1 connected to a network core 
composed of a much smaller number of highly connected 
hubs node. Removing a leaf node has only a minimal im- 
pact on the connectivity of the network while removing a 
hub has a much greater impact — but is much less prob- 
able. It is not obvious, however, how many hubs there 
should be. One might initially suppose that the most 
robust network would be a single hub node connected to 
all the remaining nodes (a star network). It is easy to 
show T^, however, that /c = 1/2 for this network which 
is far from optimal. To determine the number of hubs 
we proceed as follows. Consider first that there are {k)N 
connections available to construct the network. Let q 
denote the number of hubs. The number of connections 
needed to connect the hubs to the leaf nodes is 2{N — q). 
If we then make the argument that we want the hubs to 
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form a complete graph using the remaining connections 
we have 

qiq-l) = {k)N-2{N^q). (28) 

Solving for q for large N we have 

q^ ^/{k)-2-VN (29) 

and we again find that the number of hubs scales as ^/N 
in a manner similar to that implied by Eq. (|irb|l . 

for the optimal network with a different prefactor. 



modal degree distributions, with q ~ \/iV high degree 
nodes (hubs) and N—q nodes of degree 1, are most robust 
to random breakdown. Also we have shown that 1 — /c 
approaches as 1/\/N, faster than any other network 
configuration including scale free networks. Finally, we 
have offered a simple heuristic argument which explains 
these results. 
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VI. DISCUSSION AND SUMMARY 

We have shown analytically and confirmed numerically 
using Monte Carlo simulations that networks with bi- 
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FIG. 1; For (fc) = 2 and for (from left to right) N = 
10^,10^,10* and for random networks only 10^ (a) 1 — /c 
vs. number of hubs q. The solid and dotted lines represent 
uncorrelated and correlated networks respectively. Dashed 
lines(short) are approximation /c°™; dashed lines(long) are 
approximation /^*®''. (b) Values of q, gmin, at which 1 — /c 
is minimal vs. A''. Squares and triangles represent uncorre- 
lated and correlated networks respectively; circles represent 
q* the value of q at which the approximations fc'^'^ and 
intersect, (c) Minimum values of 1 — /c versus A^. Squares 
and triangles represent uncorrelated and correlated networks 
respectively; circles represent the values of (1 — fc) at q — q* . 
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FIG. 2: (a) Values of q, q-mi-a, at which 1 — /c is minimal 
vs. A*'. Squares, triangles and circles represent network with 
(fc) = 2, 3, and 4 respectively (b) Minimum values of 1 — /c 
versus A'^. Squares, triangles and circles represent networks 
with {k) = 2, 3, and 4 respectively 
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FIG. 3: Examples of 100 node networks with degree distri- 
bution given by Eqs. (I14II with (k) — 2. (a) uncorrelated 
network. Note that there are disconnected pairs of nodes of 
degree 1. (b) correlated network in which each degree 1 node 
is connected to a high degree node. 




FIG. 4: Standard deviation in fc vs. A'^. Squares represent 
optimal bimodal configuration for (k) — 2. Triangles repre- 
sent scale-free configuration with A = 2 



